Picture for Guobin Shen

Guobin Shen

From Generic Correlation to Input-Specific Credit in On-Policy Self Distillation

Add code
May 12, 2026
Viaarxiv icon

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Add code
May 12, 2026
Viaarxiv icon

Wearable AI in the Era of Large Sensor Models

Add code
Apr 11, 2026
Viaarxiv icon

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Add code
Mar 04, 2026
Viaarxiv icon

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Add code
Feb 11, 2026
Viaarxiv icon

Light Alignment Improves LLM Safety via Model Self-Reflection with a Single Neuron

Add code
Feb 02, 2026
Viaarxiv icon

TEFormer: Structured Bidirectional Temporal Enhancement Modeling in Spiking Transformers

Add code
Jan 26, 2026
Viaarxiv icon

Beyond a Single Light: A Large-Scale Aerial Dataset for Urban Scene Reconstruction Under Varying Illumination

Add code
Dec 16, 2025
Viaarxiv icon

Efficient LLM Safety Evaluation through Multi-Agent Debate

Add code
Nov 09, 2025
Viaarxiv icon

MVPBench: A Benchmark and Fine-Tuning Framework for Aligning Large Language Models with Diverse Human Values

Add code
Sep 09, 2025
Viaarxiv icon